A Simple Method for Chinese Video OCR and Its Application to Question Answering
نویسندگان
چکیده
Captions in videos contain valuable information for video retrieval. Although texts in captions can be obtained easily in the new image compression formats like MPEG2, there still are many video programs encoded in older formats. Thus, video OCR is indispensable for content-based video retrieval. This paper proposes a simple video OCR method for Chinese captions, including image capturing, caption region deciding, background removing, character segmentation, OCR and post-processing. We employed Discovery Channel films as training and testing corpus. In an outside test, the accuracy of the video OCR was 84.1%. The hardware used in the experiment consisted of a computer with a P4-1.7G CPU, 256MB RAM and a 40G, 7200rpm hard disk. On average, it took 29 minutes and 11 seconds to process a film 495MB in size. We also applied the results of video OCR to video retrieval and question answering.
منابع مشابه
簡易影片字幕文字辨識法及其詢答應用 (A Simple Method for Video OCR and Its Application on Question Answering) [In Chinese]
متن کامل
Optimizing question answering systems by Accelerated Particle Swarm Optimization (APSO)
One of the most important research areas in natural language processing is Question Answering Systems (QASs). Existing search engines, with Google at the top, have many remarkable capabilities. But there is a basic limitation (search engines do not have deduction capability), a capability which a QAS is expected to have. In this perspective, a search engine may be viewed as a semi-mechanized QA...
متن کاملBVideoQA: Online English/Chinese bilingual video question answering
This article presents a bilingual video question answering (QA) system, namely BVideoQA, which allows users to retrieve Chinese videos through English or Chinese natural language questions. Our method first extracts an optimal one-to-one string pattern matching according to the proposed dense and long N -gram match. On the basis of the matched string patterns, it gives a passage score based on ...
متن کاملA weighted string pattern matching-based passage ranking algorithm for video question answering
Video question answering aims to pinpoint answers in response to user’s specified questions. However, most question answering technologies involve in integrating rich specific external knowledge such as syntactic parsers, which are often unavailable for many languages. In this paper, we present a new string pattern matching-based passage ranking algorithm for extending traditional text Q/A towa...
متن کاملInvestigating Embedded Question Reuse in Question Answering
The investigation presented in this paper is a novel method in question answering (QA) that enables a QA system to gain performance through reuse of information in the answer to one question to answer another related question. Our analysis shows that a pair of question in a general open domain QA can have embedding relation through their mentions of noun phrase expressions. We present methods f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IJCLCLP
دوره 6 شماره
صفحات -
تاریخ انتشار 2001